🔧 Systems-level optimizations for LLM serving - pleto · Scour

Relation-Aware Bayesian Optimization of DBMS Configurations Guided by Affinity Scores

arxiv.org·1d

✨Model optimizations in LLMs

Flag this post

From product to system network challenges in system of systems lifecycle management

arxiv.org·1d

⚙️AI Infrastructure Automation

Flag this post

DialectalArabicMMLU: Benchmarking Dialectal Capabilities in Arabic and Multilingual Language Models

arxiv.org·1d

🧠Large Language Models (LLMs)

Flag this post

Advancing Explainable AI in Radiology Research with NVIDIA Clara Reason

developer.nvidia.com·1d

📊AI Performance Profiling

Flag this post

LLM-Centric RAG with Multi-Granular Indexing and Confidence Constraints

arxiv.org·1d

🧠Large Language Models (LLMs)

Flag this post

Simple Additions, Substantial Gains: Expanding Scripts, Languages, and Lineage Coverage in URIEL+

arxiv.org·1d

🧠Large Language Models (LLMs)

Flag this post

Challenges in Credit Assignment for Multi-Agent Reinforcement Learning in Open Agent Systems

arxiv.org·1d

🤖Agents using LLMs

Flag this post

Culture Cartography: Mapping the Landscape of Cultural Knowledge

arxiv.org·1d

🧠Large Language Models (LLMs)

Flag this post

Contrastive Knowledge Transfer and Robust Optimization for Secure Alignment of Large Language Models

arxiv.org·1d

🧠Large Language Models (LLMs)

Flag this post

zFLoRA: Zero-Latency Fused Low-Rank Adapters

arxiv.org·4d

🧠Large Language Models (LLMs)

Flag this post

Structurally Valid Log Generation using FSM-GFlowNets

arxiv.org·4d

🧠Large Language Models (LLMs)

Flag this post

A Quantitative Framework to Predict Wait-Time Impacts Due to AI-Triage Devices in a Multi-AI, Multi-Disease Workflow

arxiv.org·1d

📊AI Performance Profiling

Flag this post

Can MLLMs Read the Room? A Multimodal Benchmark for Verifying Truthfulness in Multi-Party Social Interactions

arxiv.org·1d

🧠Large Language Models (LLMs)

Flag this post

Reasoning Models Sometimes Output Illegible Chains of Thought

arxiv.org·1d

🧠Large Language Models (LLMs)

Flag this post

Context-Aware Stochastic Modeling of Consumer Energy Resource Aggregators in Electricity Markets

arxiv.org·1d

✨Model optimizations in LLMs

Flag this post

VISTA Score: Verification In Sequential Turn-based Assessment

arxiv.org·1d

🧠Large Language Models (LLMs)

Flag this post

StreetMath: Study of LLMs' Approximation Behaviors

arxiv.org·4d

🧠Large Language Models (LLMs)

Flag this post

SERVIMON: AI-Driven Predictive Maintenance and Real-Time Monitoring for Astronomical Observatories

arxiv.org·1d

⚙️AI Infrastructure Automation

Flag this post

Auditing LLM Editorial Bias in News Media Exposure

arxiv.org·1d

🧠Large Language Models (LLMs)

Flag this post

Spiking Neural Networks: The Future of Brain-Inspired Computing

arxiv.org·1d

⚡Real-time AI Systems

Flag this post

Loading more...